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1.  Introduction  and  Summary. 

Suppose  X , • • • s are  independently  distributed  according  to 
2 2 

■®(^l»  aqK  • • • a N(yn,  a^)  , respectively.  Does  the  distribution  of  the 
maximum  of  X , X uniquely  determine  (y  , CJ2) , (y  , a2)  ? 

In  other  terms , can  more  than  one  set  of  independent  normal  variables 
give  rise  to  the  same  distribution  of  the  random  variable  which  is  the 
maximum  of  the  constituent  random  variables?  Our  paper  answers  this 
question;  in  fact,  the  relevant  theorem  answers  this  question  about 
identification  of  parameters  on  the  basis  of  the  maximum  of  a set  of 
random  variables  for  a more  general  family  of  parent  distributions. 

Our  attention  was  called  to  this  problem  by  an  inquiry  from  an 
econometrician  (Professor  Takeshi  Amemiya,  Department  of  Economics, 
btanford  University).  In  an  econometric  model  X^  is  defined  as  the 
quantity  of  a commodity  which  consumers  would  be  willing  to  purchase 
under  specified  circumstances  including  the  price  of  a unit , and  Xg 
is  the  quantity  that  would  be  sold  by  the  producers  under  certain 
conditions.  A simple  stochastic  model  specifies  the  demand  and  supply 


schedules  as 


(1.1)  X1  = ai  + ^lp  + U1  5 

(1.2)  X2  = a2  + $2P  + u2  , 

where  p is  the  price  of  the  commodity  and  u^  and  Ug  are  inde- 
pendent (unobservable)  normal  random  variables  with  means  0 and 
2 2 

variances  and  Gg  , respectively.  If  the  price  is  set  indepen- 

dently of  the  market  (for  example,  by  an  outside  agency  or  by  custom), 
then  the  quantity  actually  sold  by  the  producers  to  the  consumers  at  a 
given  price  p is  the  minimum  of  X^  and  Xg  given  by  (l.l)  and  (l.2). 
Since  the  econometrician  wishes  to  determine  the  demand  and  supply 
schedules,  he  raises  the  question  of  whether  observations  on  the  quantity 
passing  from  producers  to  consumers  at  various  prices  determine  the 

parameters  of  the  model:  the  intercepts  and  a 2 , the  slopes  3^ 

2 2 

and  Bg  > and  the  variances  G^  and  Gg  . Since  the  normal  distribution 
is  symmetric,  the  question  is  mathematically  equivalent  to  the  question 
posed  in  terms  of  the  maximum  of  X^  and  Xg  . (in  the  more  customary 
equilibrium  model  there  is  a third  equation  equating  demand  to  supply, 

X = Xg  . This  model,  in  which  X^  = Xg  and  p are  observable  random 
variables,  represents  a market  in  which  the  price  adjusts  so  that  at 
that  price  the  quantity  X^  that  buyers  demand  is  equal  to  the  quantity 
Xg  that  sellers  produce.) 

In  this  paper  we  have  studied  the  question  in  greater  generality 
by  considering  the  maximum  of  an  arbitrary  number  of  random  variables. 
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Intuitively,  the  approach  is  that  if  the  variance  of  one  of  JL  , . ..,  x 

1 n 

is  larger  than  the  variance  of  the  others  the  distribution  (or  density) 
of  the  maximum  in  the  upper  tail  is  similar  to  the  distribution  (or  den- 
sity) in  the  upper  tail  of  the  component  with  maximum  variance.  It  is 
therefore  convenient  for  us  to  prove  identifiability  on  the  basis  of  a 
general  condition  on  the  upper  tails  of  the  densities  in  a certain  family 
(Section  2).  As  applications  of  this  general  theorem  we  answer  the 
question  posed  in  the  first  paragraph  in  the  affirmative  and  also  assure 
our  econometrician  that  the  parameters  in  the  model  are  identified. 

It  is  natural  to  ask  in  what  way  this  property  of  identification 
carries  over  to  multivariate  cases . In  the  econometric  example  a set 
of  consumers  may  consider  purchasing  a number  of  commodities  and  a set 
of  producers  may  propose  to  sell  these  commodities;  for  each  commodity 
the  quantity  actually  sold  from  producer  to  consumer  at  a given  price 
is  the  minimum  of  the  amount  desired  at  that  price  and  the  amount  offered 
by  the  producers  at  that  price. 

We  study  the  multivariate  case  within  the  framework  of  the  normal 
distribution,  the  parameters  of  which  are  the  means  of  the  variables, 
the  variances,  and  the  correlations  between  the  variables.  The  maximum 
of  the  vector  variable  is  defined  as  the  vector  of  the  maxima  of  the 
respective  elements  of  the  constituent  vector  variables.  Consideration 
of  the  marginal  distribution  of  the  maximum  of  a particular  coordinate 
of  the  constituent  vector  variables  determines  the  set  of  means  and 
variances  of  that  coordinate  (by  Theorem  2.l).  However,  the  one-to-one 
correspondence  between  the  means  and  variances  of  two  different  coor- 
dinates of  the  constituent  vector  variables  is  not  available  from  study 
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of  the  several  marginal  distributions . Moreover,  identification  of  the 
correlation  coefficients  in  the  distributions  of  the  vector  variable 
cannot  be  made  from  one-dimensional  distributions. 

In  Section  3 we  treat  the  multivariate  normal  distribution  where 
all  means  are  zero  and  all  the  correlations  are  nonnegative ; in  this 
case  there  is  identification  wherever  the  correlation  is  not  zero.  In 
Section  5 we  treat  the  case  of  two  vector  random  variables ; then  there 
is  complete  identification.  In  order  to  treat  this  case  we  develop  a 
generalization  of  Mills'  ratio,  which  is  of  interest  in  its  own  right. 
This  is  given  in  Section  4. 
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2.  The  Univariate  Case. 


Theorem  2.1.  Let  ^ he  a family  of  probability  density  functions 

f on  L which  are  continuous  and  positive  to  the  right  of  some  point 

A and  such  that  if  f and  g are  any  two  distinct  members  of  cF* 

then  lini^tf  (x)/g(x)}  exists  and  equals  either  0 or_  °°  . Let 

X] , . . . s be  independent  random  variables  with  respective  pdf’s 

f ^ , ...»  f ^ in,  $ » and  Y^ , . . . , Y be  independent  random  variables 

with  pdf’s  £ 3r  . If  max{X  , . . . , Y } and  max{Yn  . . . . » Y } have 

identical  distributions,  then  m = n and  there  exists  a permutation 

{k^ , . ..,  kml  of  {l,  . . . , m}  such  that  the  pdf  of  Y.  is  f^  , 

i 

i = 1,  . . . 5 m . 


Proof.  By  definition 


m m 

(2.1)  Pr  [max{X  , . . . , X } x]  = II  Pr{X.  <.  x}  = II  F.  (x)  , 
1 m , , l 

i=l  i=l 


where  F^(x)  = JXoo  f_^(u)du  , and  similarly. 


n 

(2.2)  Pr[max{Y  , . . . , Y } <L  x]  = n G.  (x)  , 

i=l 


where  G^(x)  = g^(u)du  , and  g^,  ...,  g e S5*  . We  are  given 


m n 

(2.3)  II  F.  (x)  = IT  G.  (x)  , -°°  < x < 00 

i=l  1 i=l  1 


Hence , 
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(2,4) 


m n 

y In  F.(x)  = 7 In  G.(x)  . 

. _ 1 . i 

i=l  i=l 


Differentiating  with  respect  to  x , we  have  for  all  x > A 


where  a±  = 1 , i = 1 , . . . ,m , and  a±  = -1 , i = m+1 , . . . ,m+n , and 


f , . = g.  , i = 1,2,. . 
m+i  °i 

. ,n 

• 

There  exists  an 

f. 

1 

, say  f1  , such 

that 

limx^ooIfi  (x)/fx(x)] 

is  either  0 or  1 

for 

i = 2 , . . . ,m+n  . 

Let 

I = 

{i  : fi(x)/f1(x)  ■+  1 

as 

x ■+  °°}  . Then 

dividing  (2.6)  by  f^(x) 

and  letting  x ■+  OT  , we  have  = 0 , so  that  I contains  an 

even  number  of  elements , half  of  which  are  from  {l,  • • • , m}  . Thus 
a certain  number  of  f ^ in  (2.5)  are  identical  and  are  identical  to 
the  same  number  of  g^  . Subtracting  these  identical  terms  from  both 

sides  of  (2.5),  we  have  a new  equation  of  the  same  form  but  with  fewer 

terms.  The  process  can  be  iterated  until  each  term  on  one  side  of  (2.5) 
is  matched  with  one  term  on  the  other.  If  m = n , the  proposition  is 

proved.  On  the  other  hand  if  m 5^  n , say  m < n , we  have  n - m of 

the  g.  such  that  g^(x)  = 0 for  all  x > A , contrary  to  the  assump- 
tion about  . Hence,  m = n . Q.E.D. 
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means 


SSSSBlg— Tiie  family  of  normal  densities  with  arbitrary 
and  variances  satisfies  the  assumptions  of  the  theorem.  In  fact,  if 


(2.7) 


<J>  (x  | y , cr ) = — 

/2?  i 


exp 


r C^-u)2' 

2a2  . 


then  as  x -*  oo 


(2.8) 


4>(x|  y2,a2) 


0 if  a_L  > a2  or  if  a2  = a2  and  y_L  > y2  , 

if  ax  < a2  or  if  a2  = a2  and  y < y 


2 2 

1 if  ax  = a2  and  y1  = yg 


Example  2.2.  Let 


(2.9) 


<j)(x|a,3  ,a,p)  = 


/Sr 


exp 


(x-q-gp) 
2 


, n2i 


2a 


Then  as  x 


(2.10) 


0 ^ °1  > a2  or  if  ai=4  and  al  + h*l  > «2  + 32P2  ’ 

2 2 


<j)(x|a2,32,a2,p2) 

^j”lf  a1<  ff2  or  if  ai  =a2  and  ai+  PpPp  < a2+  g2p2  , 
2 2 

1 if  ax  = a2  and  + 31p1  = a2  + ggp2  . 


The  parameters  are  identified  if  at  least  two  of  X^t  X have 

densities  with  the  same  pair  a,g  and  different  values  of  p . (in  the 
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econometric  "equilibrium"  model  = X0  and  p are  random  variables 

determined  as  the  solution  of  (l.l)  and  (1.2);  any  joint  normal  density 

of  these  two  variables  is  consistent  with  various  sets  of  3^  » 

2 2 

a±  and  and  hence  the  parameters  cannot  be  identified. ) 

Example  2.3.  Let  <j>^Gx)  =0,  x < 0,  = Ae“^X,  x>  0 . Then 


(2.11) 


0 

if 

A1 

< x2  , 

4b  (x) 

2 

<k  (x) 

00 

if 

A1 

> ^2  5 

1 

1 

if 

X, 

= X„  . 

la 

1 

2 

■ families 

of  pdf 

' s which  do 

postulated  in  the  theorem  but  the  members  of  which  are  identified  by 
the  distribution  of  the  maximum  random  variable. 

Example  2,k.  Let 


(2.12) 


-(x-a) 

e , x >.  a , 


fa(x)  -< 


0 , 


x < a . 


Lf  f ^ f ^ 5 i 1 9 • • * 3 3 X 1 3 • • • s then 

i i 


(2.13)  Pr  [max{X.  , i = 1, . . . ,m)  <.  min{a.  , i = 1 , . . . ,m}]  = 0 


implies 
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(2.14)  min{  a , . ..,  a^}  = . ..,  . 


Further , the  form  of  the  cdf  of  the  maximum  for  values  of  x between 

the  smallest  and  second  smallest  distinct  tells  us  that  the  number 

of  a.  = min{a.}  is  the  same  as  the  number  of  b.  = min{a.  } •,  and  also 
xi  1 1 

that  the  second  smallest  ai  = the  second  smallest  K . Proceeding 
this  way,  we  reach  the  conclusion  that  m = n and  {a^,  a^}  is 

a permutation  of  (b  , b^}  . 

Example  2 . 5 . Let 

1 "!x"a 

(2.15)  fa&)  = g6 

In  this  case,  if  f.  = f and  g.  = f , then  for  all  sufficiently 

1 3» « 1 D » 

1 1 

large  x , equation  (2.3)  becomes 


(2.16) 


Without  loss  of  generality,  we  may  assume  m <.  n , write  z - e , and 
multiply  both  sides  of  (2.16)  by  zn  , giving 


(2.17) 


Equating  the  zeros  of  the  two  polynomials  yields  the  conclusion  that 
m = n and  {a^  ...,  a^}  is  a permutation  of  {b^  ...,  bm>  . 
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There  are  also  examples  of  a family  of  densities  whose  elements 


are  not  identified  by  the  distribution  of  the  maximum, 
let 


(2.18) 


fjx) 


x <0  , 


x > 0 , 


and  ^ = {f  : a > 0}  . If  f.  = f , i = 1 

a x a.  5 

x 

max-Cx  j,}  is 


(2.19) 


. ~m 

r xST  ,a. 
e 1=1  1 

< 

1 


x <.  0 
x > 0 


sm. 


and  the  only  inference  from  (l)  is 


a. 

i 


En  b. 
i=l  i 


/ 


For  example , 


the  cdf  of 
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3.  A Special  Multivariate  Normal  Case. 

t 

Suppose  the  vector  X.  = (X, . , ....  X . ) has  the  distribution 

~i  li  ’ ’pi 

N(y. , I . ) , i = 1 , . . . ,n  . Let  Y . = max (X  , . . . , X . ) and  let 

'vl  J J-i  JP 

f 

Y = (Y  , . . . , Y ) . Does  the  distribution  of  Y determine  the  para- 

~ JL  p ~ 

meters  y.  , Z , . . . , y , Z ? Consideration  of  the  marginal  distribu- 
~ _L  ~_l  ~n  ~n 

tion  shows  that  the  distribution  of  Y.  determines  the  values  of  the 

J 

2 2 th 

pairs  (p„,  0..),  . ..,  (]i.  , 0.  ) for  each  j . (The  i ,k  element 
jl  jl  jn  jn 

of  Z.  is  0..0  p . .)  But,  the  marginal  distributions  do  not  lead 

•’■'1.  jl  K1  JK.1 

to  correspondences  between  the  different  values  of  j . However,  if 
bivariate  distributions  are  identified,  then  the  correspondences  for 
j = l,...,p  can  be  made.  Accordingly  we  turn  our  attention  to  bivariate 
distributions. 

We  consider  an  arbitrary  number  n of  bivariate  normal  distribu- 
tions but  restrict  them  so  as  to  minimize  the  kind  of  tedious  computa- 
tions which  we  are  not  able  to  avoid  in  Section  5 in  the  detailed  dis- 
cussion of  the  case  n = 2 . 

Theorem  3.1.  If  $ , F,  , F are  nonsingular  bi- 

variate  normal  cdf 's  with  zero  means  and  the  correlations  in 
, . . . , <I>n  are  all  nonnegative,  then 


(3.1) 


n 


m 


IT  $>•  (x,y)  = II  F . (x  ,y ) 
i=l  i=l 


implies  that  m = n , there  are  no  F_^  with  negative  correlations , the 
number  of  zero  correlations  is  the  same  on  both  sides  of  (3.1),  and  the 
F^  with  positive  correlations  can  be  identified  one-for-one  with  the 
3b  having  positive  correlations. 
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P£oof\  By  letting  one  variable  go  to  infinity  in  the  identity 
(3.1),  we  see  from  the  one-dimensional  result  of  Example  2.1  that  the 
set  of  x-variances  on  the  l.h.s.  of  (3.l)  is  a permutation  of  that  on 
the  r.h.s.;  and  the  same  is  true  of  the  y-variances.  Also,  m = n . 
Therefore,  if  the  x-variance,  correlation  and  the  y-variance  in  $ 

i 

are  respectively  Ch  , Pp , Tp  » then  we  may  assume,  without  loss  of 

generality , that  the  corresponding  parameters  for  F are  a2  r t2 

i i ’ i ’ i 

where  (t  , t ) is  a permutation  of  (t  .....  x ) . 

^ n I n 

Taking  logarithms  in  (.3.1)  and  differentiating  with  respect  to  x 
we  get 


(3.2) 


Ap(*»y) 


n i 

l “ 

i=l  ai 


n(~-)  I 


(h  ' 3r)/^?]/Fi(x-5r) 


where  n(-)  and  N(*)  are,  respectively,  the  one-dimensional  standard 
normal  pdf  and  cdf. 

Wow,  let  a = max{a.,  i = l,...,n}  , I = {i  : = a}  , 

(i  : Pp  = 0,i  e 1}  , Ig(r)  = {i  : r^  = 0,  i e 1}  . On  dividing 
(3.2)  by  (l/a)n(x/a)  and  letting  x ->  <»  , we  get 


(3.3) 


1 = 


ieIQ(p) 


iel0(r) 


1 + I l/N(y/t  ) 
{i:iel,r.<0}  1 


Since  the  last  term  is  strictly  monotone  unless  (i  : i e I,  r.  < 0} 
is  empty,  we  see  that  there  is  no  r < 0 for  i e I , and  that 
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Iq(p)  and  IQ(r)  contain  the  same  number  of  points.  Hence,  the  terms 

corresponding  to  (a  = CL  , p±  = 0)  and  (a  = a r.  = 0)  cancel  one 

another  out  in  (3.2). 

Next , let  3 = min{piri  : i e I - I (p) } , and 
Ix(p)  = {i  : PiTi  = 3,  i e 1}  , I-^r)  = {i  : rj,ti  = 3,  i e 1}  . In 

(3.2),  let  y = $x/o  + n , divide  by  l/a  n(x/a)  and  let  x ->  °°  , 

holding  u fixed.  Then  we  obtain 


(3.10 


(p) 


= 1 

iel1(r) 


{i  :r.t .<3,iel) 

ii* 


Letting  u ->  00  , we  see  that  {i  : r^t . < 3,  i e 1}  is  empty.  Dif 
ferentiating  with  respect  to  u in  (3.4)  we  get 


(3.5) 


^ 1 n|  u 

iel1(p1)  ^2'_p2 

1 1 


u 


iel 


i(r) 


[Mi 21 


Let  x - max{ T . , t.  : i e I ( p ) , j e e I,  (r ) } , divide  by 

f irr~2\  1 J 

nlu//x  -8  I and  let  u -*  . ¥e  see  that  the  number  of  x^  = r is  the 

same  as  that  of  t^  = x , and  that  for  the  corresponding  and  , 

we  have  p = 3/t  = r^  . 

Eliminating  the  terms  with  x = t±  = x from  (3.5),  we  can  repeat 
the  process  with  the  remaining  largest  value  of  the  y-variance;  and 
continuing  this  way,  we  see  that  we  have  identified 

{(pj»  P- , T. ) , i e I (p)}  with  {(a.,  r. , t.),  i e I (r)}  one-for-one. 

"4  . -1-  -X  1 1 X X. 

Eliminating  these  terms  from  (3.2),  we  can  now  repeat  the  process  with 
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the  next  larger  value  of  p.Th.  , i £ I - Iq(p)  , and  continuing  this 
way,  we  are  able  to  identify  {(c^,  pi t\  ) , i e I - I (p)}  with 
{(cr^,  r^ , t^ ) , i e I - I^Cr)}  one-for-one.  We  are  then  left  with  an 
equation  similar  to  (3.2)  but  with  a smaller  number  of  terms  and  with 
ai  < ° • Iterating  the  procedure  successively,  we  are  thus  able  to 
identify  { (ct±,  p±,  T..)  : Pj_  > 0,  i = l,...,n}  with 
{(hi,  t^)  : r^  > 0 , i = l,...,n)  one-for-one. 

Finally,  removing  these  identical  terms  from  both  sides  of  (3.l), 
we  are  left  with  products  of  one-dimensional  normal  cdfs  in  x and  y 
with  the  same  set  of  x-variances  appearing  on  both  sides  and  also  the 
same  set  of  y-variances,  but  there  is  no  unique  matching  of  (x,y) 
pairs  among  these. 


jt»-  A Bivariate  Mills8  Rati  o . 


For  the  investigation  in  Section  5 we  need  a generalization  of  the 
univariate  Mills’  ratio 


(4-1)  - Jj)  < 1 - Ufa)  < n(x)i  , x > o , 

where  n(-)  and  N( * ) are  the  standard  normal  pdf  and  cdf.  (See,  e.g. 
Feller  [l],  p.  175.) 

Theorem  4.1.  Let 


(fc.2) 


where 


» 

4 


^(x,y)  = <f>(u,v)  dudv  , 


U > X 
v > y 


(b.3) 


<j>(u,v)  = i- 


2tt/1-p£ 


exp 


2 2 
u - 2p  uv  + v 


2(l-p2) 


If.  P > 0,  A - (x-py)//l-p2  >0,  B = (y-px)/X-p2  > 0 , then 


12 

*px') 


If  piO,  A > 0,  B > 0 , then 


(1,-5>  Mx’y)U-pyj(y-px)  **<*■*>  i 


1 


-pyTTy-px) 
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Proof: 


(4.6) 


where 

(4.7) 

We  have 

(4.8) 

(4.9) 

(4.10) 
Let 

(4.11) 


$(x,y) 


1 . 


S ,t>0  2t t 


JT-? 


exp 


[_  (x+s ) 2-2p  (x+s ) (y+t ) + (y+t ) 
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2(l-p2) 


dsdt 


= <J>  (x  ,y ) 


exp 


s ,t>0 


2 2 
s -2pst+t 

2 (l-p2) 


exp 


s (x-py )+t (y-px) 
' (1-P2)  . 


Idsdt 


= cf>(x,y) (l-p2)  J (A,B)  , 


J(A,B)  = 

u,v>0 
(for  u > 0,  v >.  0) 


» 

r 2 2i 

u -2puv+v 

exp 

r\ 

2 

» 

exp[  - Au  - Bv]dudv  . 


(l-p)  (ti2+v2)  <_  u2  - 2puv  + v2  <.  u2  + v2  , p > 0 


2 2 2 2 
u - 2puv  + v = u + v , 


P = 0 > 


u2  + v2  <.  u2  - 2puv  + v2  <_  (l-p)(u2+v2)  , p < 0 


J 

a 


r 2,  2 2-,"| 

a (u  +y  ) 

exp 

2 

_ a 

j 

u,v>0 


exp(-  Au  - Bv)dudv  . 


l6 


p 

Then  for  a = 1 ~ p 


(4.12) 

(4.13) 

(4.14) 


J-]_  <.  J (A9B)  <.  J P > o , 


J (a,b)  = J p = 0 , 


Ja  ~ J (A?B)  , p < 0 


Note  that  J&  is  the  product  of  two  terms  of  the  type 


(4.15) 


exp 


u>0 


2 2 

a u + 2cu 


du 


where  c - A or  B . The  expression  (4.15)  equals 
(4.16)  exp 


' OJ  O 
O 

l 

exp 

L2a  J 

J 

u>o  L 

(au+  ~)21 
a 


du  = exp 


2a 


/2tt 

a 


f 1 i-v2\ 

~~  exp(— -)dv 

J r>  t/ Ott 


If  c > 0 , this  is  hounded  above  and  below,  respectively,  by  1/c  and 
(l/c ) (l-a  /c  ) according  to  (4.1).  For  p = 0 the  theorem  follows 
directly  from  (4.1)  because  i(x,y)  = 5(x)5(y)  . For  p > 0 and  p < 0 
the  theorem  follows  from  (4.6),  (4.12),  and  (4.l4).  Q.E.D. 

®emarks_.  Notice  that  the  conditions  A > 0,  B > 0 for  x > 0, 
y > 0 correspond  to  a point  (x,y)  in  the  first  quadrant  that  is  above 
the  regression  line  (Y  on  X)  and  to  the  right  of  the  other  (X  on  Y) 
If  P £ 0 , then  A > 0 and  B > 0 are  automatically  satisfied  if 


x,y  > 0 . 

It  is  of  interest  to  compare  Theorem  2 with  the  bound  of  Harkness 
and  Godambe  ([2]).  The  principal  term  in  our  estimate  (which  is  also 
our  upper  bound)  is 
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(4.17) 


subject  to  the  assumption  that  x - py,  y - px  > 0 (which  is 
striction  only  if  p > 0).  Since  (4.17)  is  symmetric  in  (x,y) 
y = cx  , with  1 £ c < 1/p  if  p > 0 . Then  (4.17)  is 


(4.18) 


2'nVl-p2 


exp 


x2(l-2pc+c2) 

2(l-p2) 


(1-P2)2 


x (l-pc)(c-p) 


a re- 
, let 


In  the  Harkness  and  Godambe  estimates,  the  principal  terms  in  the  upper 
and  lower  bounds  are,  respectively, 


(4.19) 


1 

x2(l-2p+l) 

1 (1+P2) 

x22(l-p) 

2-n/l-  p2 

2(l-p2)  _ 

2~  exp 

2ttX-p2  x 

yyi 

OJ 

Q. 

1 

H_ 
OJ  ■ 

i . 

9 


and 


(4.20) 


(1+P)2  1 ' 

x2c2(l-2p+l) 

1 (1+P)2 

x2c22(l-p) 

2 2 f — - eXp 

c x _ L 2 

2ir/l-p 

2(l-p2) 

— « 

. 22  P 

0 / 2 c x 

2wl-p 

2(l-p2) 

how,  (4.18)  = (4.19)  = (4.20)  if  c = 1 , but  for  1 < c < l/p  , 

(4,l8)/(4.19)  -*■  0 and  (4.20)/(4,l8)  ->-0  as  x ->  °°  . 

Finally,  if  p > 0 and  y > x/p  , we  can  still  get  quite  good 

bounds  from  (4.4)  by  using  5(x’,y)  <.  5(x,y)  <.  5(x,y')  where  x < x'  < y 

and  x < y*  < y , which  is  better  than  using  5(y,y)  <.  i(x,y)  <.  5(x,x)  . 

Since  the  denominator  in  the  last  factor  in  (4.18)  is  maximized  by 
choosing  c = (p  + l/p)/2,  5(x,y')  , with  y»  = [ (p2+l)/(2p) ]x  , might 
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"be  a reasonably  good  upper  bound  for  5(x,y)  when  y > x/p  , p > 0 ; 
similarly,  $(x',y)  * with  x'  = [2p/(p2+l)]y  might  be  a reasonably  good 
upper  bound.  In  the  case  of  a general  bivariate  normal  distribution,  we 
have 


(4.21) 


'aJb  2mrxA-p2 


exp 


2 ' 
tfi-y)  _ 2 (s-y) (t-v)  + (t— v ) ‘ 


a 


or 


2(l-p2) 


dt  ds 


.00 


r° o 


a-y  ■lb-v 

CT  T 


4>(u,v)dv  du 


= 5 


a~y 

0 


b-v 

T 


Corollary  4.1.  Let  (4.21)  be  denoted  by  5(a,b;  y,v,a,x,p)  , let 
integrand  be  denoted  by  <j>(s,t;  y,V,cr,x,p)  , and  let 


If  p > 0,  A > 0,  B > 0,  then 


(4.23)  fl  - -i-Vl  - y,V,q,T,p)AB 

' A2''  B / axcf>(a,b;  y ,v  ,a  ,x  ,p ) (l-p2 ) 

If  piO,  A > 0 , B > 0 , then 
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(k.2h)  A _ - i=|]  < H^LLy,>V,C7,T>p)AB 

' A ''  aT4>(a,b;  y,v,a,T,p)(l-p2) 


< 1 . 


Corollai2_J^2 . Let  y - v - c (x-y ) , where  c is_  a positive 
constant . Then,  as  x ->  <»  , we  have 


( 1+ . 25 ) v »V-».g;»T  ,p ) U-y ) 2 (T-cpg ) ( ca-or ) 


3 3 

cr  t <j>  Cx , cx ; y,v,oyr,p)  (l-p2) 


2\2 


-*■  1 


Si.1  c > 0 if  P <.  0 and  for  c e (p x/o , x/[pa] ) if  p > 0 
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^ ^ General  Case  of  Two  Multivariate  Normal  Digtributi on.  . 

As  was  explained  in  Section  3,  the  solntion  of  identifiability  for 
the  multivariate  normal  case  depends  on  the  solution  for  the  bivariate 

normal  case.  We  treat  the  case  of  n - 2 in  terms  of  the  notation  of 
Section  3. 

~-"°rem  ^‘1‘  ~ 5 $2 ’ Fl 5 F2  are  ^variate  normal  cdf 's  such 

that 


(5.1) 


$!(x»y)  $2(x’y)  = P2(x,y)  , 


Q£g.  of  the  following  relations  holds : 


*1 

II 

and 

$ 

*2 

= m 

2 5 

or 

*1 

II 

fO 

and 

$ 

*2 

r\ 

H 

II 

or 

(iii)  $.(x,y)  = A.(x)  B.(y),  i = 1#2>  and 

F2_U,7)  = A1(x)  B2(y),  F2(x,y)  = A2(x)  B1(y)  . 


Proof-  We  shall  give  a detailed  proof  under  the  assumption  that  the 
means  are  all  zero.  The  proof  for  the  general  case  of  arbitrary  means 
is  not  different  in  spirit,  but  only  involves  additional  tedious  compu- 
tations of  the  same  kind.  As  in  Section  3,  we  shall  assume  the  parameters 
Be  (ca,  Pj_ » ) » i - 1,2,  on  the  lhs  of  (5.l)  and  (o?,  r.  t^) 

i = 1,2,  on  the  rhs.  From  (5»l)  we  obtain 
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~2 

(5.2)  g(x,y)  = Tgg^-  Iftj.fac.y)  $2(x5y)] 

m*i*2  + &*iWt2  + &*i&*2  + *1*2 


= ^U.y)  <j>2(xay) 


+ $x(x,y)  <t>2(x,y)  , 


where  n(-)  and  N( • ) are  univariate  standard  normal  pdf  and  cdf, 
respectively.  Also, 

a2 

(5”3)  f^>y)  = lFi(x,y)  P2(x,y)]  . 

I*  First  consider  the  simple  case  o±  = $2,  t±  = t£  . By  consi- 
deration of  the  marginal  distributions  we  have  a = o = s = =5 

1 2 1 &2  » 

Tp  “ T2  = = "^2  * so  ^at  "the  Problem  can  be  reduced  to  standard  form 

by  scale  transformations  on  x and  y . In  this  case  (5.2)  and  (5.3) 
give  us 
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| CM  CM 


(5.4) 


sp  (*»y) 


2tt/W 


2 ?\ 
x -2p  xy+y 

— ; — 2— 

2(1-P=)  I 2 


/ \ r'-i 47  ' 

+ n(x)n(y)  If  = — 

X^F 


(x-P-,y)  (y-p_x) 


t \ t \ (y-Pix)  (x-Poy) 

+ n(x)n(y)  N N £_ . 


+ $,  (x,y)  —— i- 


2tt/1-p^ 


2 2 
x -p2xy+y 

2(1-Pg) 


sr(x»y)  • 


On  putting  y = x , we  have 


(5-5) 


gp(x,x) 

n2(x) 


1 2 


J^-x  $2(x,x)  + 2U 


fi-pi  /l-pj 

1+pi  Ti  /i+p2  * 


2 2 


r^rx  Vx’x) 


Suppose  max(pis  P2»  T1»  rg)  - px  > 0 and  let  r1  J>  . Then  as 
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(5.6) 


if  P3_  > P2 


if  Px  = p2  . 


On  the  other  hand 


(5-7) 


grU,x)  / / Pi  a 

'exp  — x 


n2(x) 


0 if  P1  > r±  >.  r2  , 


/ 2 

/1-p, 


*/CpF 


if  P1=r1>r2  , 


if  Pi  = ri  = r2  » 


Hence,  if  p±  = p2  , then  rx  = r2  = Pi  = P2  and  $1  = $2  “ F1  = F2  * 
If  Pi  > P2  5 "then  r1  = p1  and  r2  = P2  ’ thus  ^ = F1  and  = F2 
Now  suppose  max(pl3  p2>  r^  rg)  = px  = 0 . If  pg  = 0 , 

2 

(5.5)  ■*  4 ; then  g^(x,x)/n  (x)  ->  U , which  implies  = 0 . If 

“ 2 

P2  < 0 , (5.5)  -*■  3 ; then  gr(x,x)/n  (x)  3 , which  implies 

r1  = 0 > r2  = p2  . 

Now  suppose  max(p^,  Pg,  r^5  r^)  = p^  < 0 , Then  (5«5)  2 . 

Consider 


(5-8) 


g (x,x) 
n2(x) 


- 2 = 


/Tp? 


exp 


(1S7  x j*2(x’x) 


2k 


By  Mills  * ratio 


(5.9) 


N 


1+Pi 


VBF“V^r-a 


Then  as  x °° 


(5.10) 


rg.(x.x) 

r 

c 

O-l 

A n2  f "pl  _2\  J 

L ' J 

/!  Pl  exp^+p^  xJ-M 

1 if  p > p 


2 if  Pl  = p2  . 


Similarly  as  x -*■  00 


rg  (x,x)  “i 


(5.11) 


n2(x) 


- 2 


L 2 / ^1  2^ 

/l~p  exp 


{$■1 


0 if  px  > rx  , 


< 1 if  rl  * P1  > P2  ’ 


2 if  rx  - r2  = Pl 


¥e  obtain  identification.  / 

II.  Having  disposed  of  the  case  ap  = °2’  T1  = T2  5 we  nOW  consider 
the  situation  where  there  is  at  least  one  inequality.  Without  loss  of 
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generality,  we  may  suppose  a1  > . Then  in  Equation  (5.2),  if  we  keep 

y fixed  and  let  x ->  00  , we  have 


(5.12) 


g(x,y)a: 
~ ^x  j“~ 


if  p1  > 0 


t-fe) 


if  px  = 0 , 


if  p1  < 0 


In .the  same  way,  from  (5.3)  we  get 


if  > 0 , 


f(x,y)cr 

C5-13) 

nk, ) 


r(h)Bfe)+tnfe)B(h) 


if  r = 0 


if  r1  < 0 . 


Consequently,  from  (5.1)  we  see  that 


(a) 

pl  > 

0 =>  r 

(6) 

pl  = 

0 =>  r 

(y) 

pl  < 

0 =>  r. 

In  Case  (a),  let  y = yx  , where  y = P-j^/c^  , and  x -*■  °°  ; then, 
noting  that  1/a2  - 2py/(ax)  +y2/x2  = (l-p 2)/a2  + (p/a  - y/x)2  , from 
(5.2)  we  obtain 


(5.14) 


sU.Yx) 

It 


' CT. 


x^xil-p2)' 
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On  the  other  hand,  looking  at  the  expression  for  (5.3)  similar  to  that 
for  (5.2),  we  observe  that 


(5-15)  f^(xsyx)  — ~ — 

a.  t.i/2Tr(l-r?) 


•fe) 


exp 


1 1 


fx. . ft]*/ 
Ih  gi  1 

2(l-r?) 


so  that 


(5.16) 


efoy*).  a + < 


0 if  X ^ 


if  ^ 


Hence,  on  account  of  (5.1),  we  conclude  that 


(5.17) 


T _ 1 j.2/  2\  2/  2. 

— and  t^l-r^  = T^l-p^ 


This  implies  t1  = T1  and  r±  = px  . So,  in  Case  (a),  ^ = F and 

hence  ^2  = F2  * 

Next,  in  Case  (3),  the  original  equation  (5.1)  becomes 


(5.18)  »(xj  J2(x,y)  = »(x)  F2(x,y)  . 

If  we  now  remove  the  common  factor  N(x/0  ) from  both  sides,  differen- 
tiate with  respect  to  x and  remove  the  common  factor  (l /o^nix/o^) 
from  both  sides , we  are  left  with 
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If  pg  0 , the  lhs  of  (5.19)  is  independent  of  x , and  hence  = 0 ; 
in  this  case,  both  sides  of  (5.1)  are  products  of  univariate  cdf s , and 
there  is  no  unique  matching  of  (x,y)  pairs.  On  the  other  hand,  if 

^2  ^ "then  r^  / 0 ; and  differentiating  (5.19)  with  respect  to  x ; 
we  have 


Setting  y - 0 yields  pg  = r^  , and  with  arbitrary  y putting  x = 0 
gives  t2  = x2  . Hence,  ^ = F±  and  $2  = Fg  . 

Finally,  in  Case(y),  P1  < 0 , r±  < 0,  = T± , i = 1,2  . If, 

in  (5.1),  we  take  x,y  < 0 and  use  the  fact  that  $(x,y)  = $(-x,-y) 
for  a normal  cdf,  we  obtain 


2 2 

(5.21)  n 5 (x,y)  = n F.  (x ,y ) , x,y  > 0 . 

i=l  1 i=l  1 


Now,  if  we  set  y - cx  and  let  x -*■  00  , we  see  from  Corollary  k.2  that 
the  asymptotic  relation  (k.2k)  holds  for  ^ and  f±  for  all  c > 0 
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c in  an  interval  of 


and  also  holds  for  $2  and  F 2 at  least  for  all 
positive  length  containing  the  point  T2/a2  . Hence,  we  have 


(5.22) 


2 <f>  (x,cx)(l-p  )2 

n — 1 1 

i=l  2 
x 


fi(x,cx)(l-r2)2 


+ 1 


as  x ->•  oo  } for  all  c in  an  interval  of  positive  length  containing  the 
point  t2/o2  . But 


(5-23) 


2 

n 

i=l 


<j)i(x,cx) 

fTTxTcxT 


2 

n 

i=l 


( 1-ri )'* 
(1-P  \fZ 


J 


where  Q(c)  is  a quadratric  polynomial,  and  (5-22)  implies  that  the  rhs 
of  (5-23)  has  a finite  positive  limit  for  all  c in  an  interval  of  posi- 
tive length.  This  can  happen  <?nly  if  Q(c)  = 0 ; the  limit  is  then 
ni=l/(l-ri)/(l-Pi)  • Thus  we  have 


(5.24) 


2 

I 

i=l 


2 

xPi 


2p.c  2 

— + % 

a.x.  2 

i 1 t.  , 
ij 


2\— 1 

d-pi) 


= I 


i=l 


2 

L.ai 


2r . c 2 

-i-+  % 

0.T.  2 

11  T. 

1-1 


(IT2)"1 


and 


(5.25) 


2 p -3/2 

n (i-pd 

i=l 


2 

n 

i=l 


both  relations  holding  for  all  c in  an  interval  of  positive  length. 
Consequently,  from  (5.24)  we  obtain 


(5.26) 
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(5.2?) 


(5.28) 


1 


Jfcir  $)'£($-;#•• 
(*-*)•*(*-*)■ 


CT1T1  Vi-p2 


0 . 


If  r. 


then  ^ Fi  ’ and  ^ence  $2  = Fg  . So,  it  remains  only 


to  investigate  the  possibility  ^ + p±  . in  this  case5  (5.26)==>  ^ p 

and  from  (5.26)  and  (5.27)  we  have 


(5.29) 


= T 


say 


But  from  (5.25)  we  know  that  the  polynomials  in  c on  the  two  sides  of 
the  equation  have  the  same  zeros.  The  zeros  of  the  lhs  are  {x/p^  xp^  0} 
lf  P2  - 0 and  {xp1,  x/p^  xp2,  x/p2>  if  pg  ± 0 and  of  the  rhs 
{f/r-j-,  Tr1  > 0}  if  r2  = 0 and  (xr1,  t/t±,  xr 2,  x/r2>  if  rg  # 0 . 

Hence,  the  assumption  that  ^ 5*  p±  < 0 leads  to  the  conclusion  ^ = pp_ 
and  r2  - pi  . This , together  with  (5.2 6),  contradicts  the  assumption  that 
^ > o2  . Thus  in  Case  (y)  also,  we  must  have  ^ rp  = p2,  so  that 

$1  = Fl»  §2  = F2  * Q’E.D. 
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